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Abstract 

Background: Telomeres, the protective cap of chromosomes, have emerged as powerful markers of biological age 
and life history in model and non-model species. The qPCR method for telomere length estimation is one of the 
most common methods for telomere length estimation, but has received recent critique for being too error-prone 
and yielding unreliable results. This critique coincides with an increasing awareness of the potentials and limitations 
of the qPCR technique in general and the proposal of a general set of guidelines (MIQE) for standardization of 
experimental, analytical, and reporting steps of qPCR. In order to evaluate the utility of the qPCR method for 
telomere length estimation in non-model species, we carried out four different qPCR assays directed at humpback 
whale telomeres, and subsequently performed a rigorous quality control to evaluate the performance of each assay. 

Results: Performance differed substantially among assays and only one assay was found useful for telomere length 
estimation in humpback whales. The most notable factors causing these inter-assay differences were primer design and 
choice of using singleplex or multiplex assays. Inferred amplification efficiencies differed by up to 40% depending on assay 
and quantification method, however this variation only affected telomere length estimates in the worst performing assays. 

Conclusion: Our results suggest that seemingly well performing qPCR assays may contain biases that will only be detected 
by extensive quality control. Moreover, we show that the qPCR method for telomere length estimation can be highly 
precise and accurate, and thus suitable for telomere measurement in non-model species, if effort is devoted to 
optimization at all experimental and analytical steps. We conclude by highlighting a set of quality controls which may serve 
for further standardization of the qPCR method for telomere length estimation, and discuss some of the factors that may 
cause variation in qPCR experiments. 
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Background 

Telomeres play a key role in maintaining chromosome in- 
tegrity and are crucial for normal cell function [1]. In verte- 
brates, telomeres consist of tandem repeated TTAGGG 
sequences at the end of chromosomes [2], Telomere 
repeats are lost during cell replication and by oxidative 
damage [3-5], and telomeres thus tend to shorten with age, 
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ultimately reaching a threshold which likely contributes to 
cellular and organismal senescence [6-10]. 

The predicted loss of telomere repeats with age, and 
observations that the rate at which this happens correl- 
ate with fitness, lifespan, and susceptibility to a range 
of diseases, have sparked a general interest in under- 
standing the role of telomere dynamics in life histor- 
ies [11-20], as well as using telomeres as a molecular 
tool for determination of chronological and biological 
age in non-model species [21-23]. 

One approach to telomere length estimation is the 
qPCR method developed by Cawthon [24,25]. Target- 
ing telomere repeat sequences in a qPCR assay is 
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possible by the use of specially designed primers with 
build-in mismatches that allow for binding and ampli- 
fication of the telomere target, but not amplification of 
primer dimers [24,25]. In qPCR, a DNA-binding fluor- 
escence-dye such as SYBR green is used to monitor 
the amplification in individual PGR reactions and de- 
termine the point in time during cycling, the Cq value, 
when amplification of target crosses a fixed threshold. 
The resulting sample Cq values can be translated into 
estimates of the amount of telomere repeats (T) in a 
sample by means of a standard curve under the as- 
sumption that the amount of fluorescence directly cor- 
relate with the amount of double-stranded DNA that 
is amplified. This T can be scaled against the amount 
of a single copy reference gene (S) to obtain the T/S 
ratio, which is an estimate of the relative amount of 
telomere repeats in the sample. 

The general speed, sensitivity, and conceptual and 
practical simplicity of the PGR technique have resulted 
in qPCR becoming the touchstone for nucleic acid quan- 
tification and comparison in several disciplines [26,27], 
and likewise, the telomere qPCR approach to telomere 
length estimation have become one of the most common 
methods for studying telomere dynamics. 

However, despite its conceptual transparency and 
practical simplicity, obtaining, analyzing, and interpret- 
ing qPCR data is not a trivial issue. In particular, the 
high sensitivity of the technique implies that results 
may be of low precision and/or misleading if the qPCR 
assay is not adequately optimized. These problems have 
been exacerbated by the wide applicability and popular- 
ity of the technique along with a lack of general guide- 
lines for how to perform and report qPCR experiments. 
As a consequence "qPCR has become an inadequately 
standardized, complex, and frequently, inconsistent tech- 
nique that invites the publication of flawed conclusions 
[28]. A similar critique has been directed towards 
the qPCR approach to telomere measurement and there 
is a growing debate about its research applications 
[29-33]. 

Still, there are several reasons why telomere estimation 
by the qPCR method may be attractive. First, many of 
the above issues are not inherent characteristics of the 
qPCR technique but mainly result from unfamiliarity 
with its technical requirements and limitations. This 
issue has been addressed by the recent formulation of 
the MIQE guidelines (Minimum Information for publi- 
cation of Quantitative real-time PGR Experiments) with 
the purpose to ''ensure the integrity of the scientific lit- 
erature, promote consistency between laboratories, and 
increase experimental transparency' [26-28]. Second, 
scientific progress is made by developing, testing and 
optimizing, rather than just criticizing, as also noted by 
Monaghan [29] and Smith and co-authors [33]. That is. 



estimation of measurement precision and accuracy may 
provide the information required to identif)^ and reduce 
the factors causing variability, and hence optimize the 
qPGR method. Finally, until recently, the qPGR method 
provided the only avenue for measuring telomere length 
in skin biopsies from free-ranging and generally in- 
accessible species for which even little information on 
telomere dynamics, however preliminary, is of value. 

Here, we evaluate the performance of four assays 
based on the qPGR methods described by Gawthon 
[24,25] . The four assays differed with respect to primers, 
reagents, qPGR platform, and experimental setup, and 
were all modified specifically for estimation of telomere 
lengths in humpback whale skin samples. Our goals 
were to; 

i. Arrive at a reliable qPGR assay specifically tailored 
for measuring telomeres in humpback whale skin 
samples 

ii. Build on the MIQE guidelines and work of Karlen 
and co-authors [34] to propose a set of quality 
controls which may serve for further standardization 
of the qPGR method 

iii. Highlight a subset of the factors affecting the 
precision and accuracy of telomere length estimates 
by qPGR; and, if nothing else 

iv. Prevent others from making the same mistakes as 
we did by reporting all relevant aspects of our 
workflow and results. 

Our approach is unique in its use of skin samples from 
a large, free-ranging marine mammal and its comparison 
of several different qPGR assays. However, although the 
focus is that of telomere estimation in humpback whale 
skin samples, the factors causing variability and the prin- 
ciples of quality control are universal and should apply 
to many qPGR applications. 

Methods 

Sampling and DNA extraction 

Skin samples were obtained from the Gulf of Maine hump- 
back whale population by use of biopsy techniques [35,36] 
and subsequently stored at -20°G in 10% DMSO. Genomic 
DNA was extracted using a modified phenol-chloroform 
method [37] or the QIAGEN™ DNeasy Blood and Tissue 
kit according to the manufacturer s instructions and stored 
in TE buffer (10 mM Tris Gl, 0.5 mM EDTA, pH 9.0) at 
-20°G. The DNA concentration of each extraction was 
measured using a Thermo Scientific NanoDrop 8000 and 
DNA quality assessed by gel-electrophoresis [37]. 

Experimental design 

Each qPGR batch (i.e. run or plate) consisted of three 
components; i) a serial dilution series prepared from a 
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pool of DNA from six humpback whales, which was used 
to construct standard curves and formed the basis of 
the primer optimization and quality control (Table 1); ii) 
"unknown" humpback whale samples for which we mea- 
sured telomere lengths and tested the performance of dif- 
ferent quantitative methods; and iii) no template controls 
(NTC), which allowed for detection of potential contam- 
ination and/or primer dimer formation. Each step of the 
dilution series, each unknown sample, and the NTC were 
run in triplicate in each batch, and the same "unknowns" 
analyzed in each of the four different qPCR assays. 

Primer optimization 

Primers were obtained from the literature [24,25,38] or 
designed using the AmplifX program [39]. Testing and 
optimization was performed in a stepwise approach in 
which we first performed conventional PCRs [40] to assess 
whether the amplicon was of the expected length, then 
Sanger sequenced [41] the amplicon to confirm primer 
specificity, and finally performed a series of qPCR reac- 
tions on each primer pair, in which primer concentrations, 
annealing temperatures, and template DNA concentra- 
tions were kept as variables. In each of these reactions, 
melting curves allowed for assessment of primer specifi- 
city [42]. The conventional PGR conditions varied across 
primer pairs, but generally consisted of initial heating at 
94°C for 2 minutes, followed by 30-40 cycles of denatur- 
ation at 94°C for 30 seconds, annealing for 30 seconds, 
and extension at 72°C for 30-240 seconds, and comple- 
tion at 72°C for 10 minutes. PGR products were separated 
by electrophoresis through 1.7% agarose gels (FMC, Inc.) 
in IxTBE buffer [37]. Sequencing was performed using 
the ABI BigDye™ Terminator v3.1 cycle sequencing kit 
and an ABI 3130 genetic analyzer according to the manu- 
facturers protocol (Applied Biosystems Inc.). Quantitative 
PGR reactions were performed as detailed below. 

qPCR reactions 

The first assay. Assay I, was a singleplex assay similar 
to that of Cawthon [43] except for using a commercial 
SYBR green master mix. PGR reaction volumes were 
25 [A and consisted of 50% ABsolute™ QPGR SYBR® 
Green Mix (Thermo Fisher Scientific, Inc.), 0.200 \iM 



ROX dye, 2-12 ng template DNA, and 0.625 \iM of each 
primer as listed in (Additional file 1: Table SI). PGR 
amplifications were conducted using an ABI PRISM™ 
7000 Sequence Detection System using the MicroAmp™ 
optical 96-well reaction plates with optical 8-cap strips 
(Applied Biosystems, Inc.). The thermal cycling profile 
was 95°G for 15 min followed by 40 cycles of 95°G for 
15 s and 56°G annealing, extension for 1 min, and con- 
cluded with a dissociation profile for construction of 
melting curves. The PGR conditions for the a-lactalbumin 
gene were identical to those of the telomere amplification, 
except that the annealing and extension temperature was 
set at 58°G. 

In assay II, we tested the multiplex approach presented 
by Gawthon [24]. Here, telomere primers have been 
designed to amplify a fixed-length PGR product with 
well-defined melting profile useful for subsequent evalu- 
ation, and the reference gene primers contain a GG-clamp 
which increases their melting temperature, allowing for 
separate amplification of telomere and reference gene. 
Reactions had a volume of 25 ul and contained 0.75 SYBR 
Green I (Invitrogen), 10 mM Tris-HGl (pH8.3), 50 mM 
KGl, 3 mM MgGl2, 0.2 mM of each dNTP, 1 mM DTT, 
0.625U of AmpliTaqGold, 1 M betaine (U.S. Biochemicals), 
and 20 ng DNA. Telomere primers and the albumin refer- 
ence gene primers were as described by Gawthon [24] with 
the exception that we used 500 nM of each telomere 
primer and 1300 nM of each albumin reference gene pri- 
mer (Additional file 1: Table SI). Amplifications were per- 
formed in a QIAGEN Rotor-Gene Q (former Gorbett 
Rotor-Gene 6000) qPGR cycler using the manufacturers 
0.1 ml strip tubes and caps. Gonditions were 95°G for 
15 min, 2 cycles of 94°G for 15 s and 49°G for 15 s, 40 cycles 
of 94°G for 15 s, 62°G for 10 s, 74°G for 15 s with signal 
acquisition, 84°G for 10 s, 88°G for 15 s with signal 
acquisition, and concluded with a melting curve ramping 
from 72°G to 95°G, rising by 0.5°G in steps of 30 s. 

Assay III was a repetition of assay II, but with telomere 
and reference amplified in separate singleplex reactions to 
assess potential reaction inhibition caused by multiplexing. 
That is, the telomere portion of the assay only received 
telomere primers, and the reference gene portion only 
received albumin primers. The telomere was amplified 



Table 1 Experimental setup of the four qPCR assays for relative telomere length estimation 



Reaction conditions 


Master mix 


Duration 


Primers 




Standards 






Telomere 


Reference 


Assay Platform 


Type 


Tel 


Ref 


Batches 


Dilutions 


LDR 


Cq 


NTC 


Cq NTC 


1 ABI 


Singleplex 


Commercial 


21 


CW2006 


BA 1998 


7 


5 


4.5-72.0 


17.6 


34.9 


33.5 40.0 


II RG 


Multiplex 


RC 2009 


6 


RC 2009 


RC 2009 


4 


4 


2.2-60.0 


15.7 


19.4 


39.9 27.9 


III RG 


Singleplex 


RC 2009 


5 


RC 2009 


RC 2009 


4 


4 


2.2-60.0 


14.1 


15.5 


37.4 36.2 


IV RG 


Multiplex 


Commercial 


6 


RC 2009 


RC 2009 


4 


5 


4.5-72.0 


13.9 


27.2 


34.3 31.1 



Duration = number of days it took to generate tine data once assays had been optimized; BA = Berube and Aguilar [38]; CW 2006 = Callicot and Womack [52]; RC 
2009 = Cawthon [24]; Batches = number of separate qPCR runs (plates); Dilutions = number of dilution steps; LDR = linear dynamic range (ng/reaction); Cq = the 
average Cq of the serially diluted standard with the lowest amount of DNA; NTC = the lowest Cq value observed for a NTC. 
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using 95°C for 15 min, 2 cycles of 94°C for 15 s and 49°C 
for 15 s, 40 cycles of 94°C for 15 s, 62°C for 10 s, 74°C for 
15 s with signal acquisition, and the reference gene reac- 
tion conditions were 95°C for 15 min, 2 cycles of 94°C for 
15 s and 49°C for 15 s, 40 cycles of 94°C for 15 s, 84°C for 
10 s, and 88°C for 15 s with signal acquisition. 

Finally, assay IV was similar to assay II, but using a com- 
mercial SYBR green master mix rather than Cawthons' 
master mix protocol Each qPCR reaction was conducted 
in a total volume of 25 [A consisting of 50% ABsolute™ 
QPCR SYBR® Green Mix Plus ROX vial (Thermo Fisher 
Scientific, Inc.), 800 nM of each primer, and 20 ng of tem- 
plate DNA. Consumables, PCR platform, and settings 
were as in assay II. 

Processing of raw fluorescence data 

Initial visual examination of the amplification curves 
were done in the ABI Prism 7000 SDS Software version 
1.2.3 for assay I and the Rotor-Gene 6000 Series Soft- 
ware version 1.7.87 for assays II-IV. Baseline correction 
was performed in LinRegPCR version 12.16 [44-46] un- 
less otherwise noted. We used the automatic "strict" 
baseline correction option to adjust for background 
fluorescence noise, but made manual adjustments based 
on the minimum level of fluorescence in the few 
instances where the program could not set the baseline 
automatically. The fluorescence threshold line for deter- 
mination of Cq values was fixed among batches and set 
to cross the linear phase of the amplification curve dur- 
ing which amplification is exponential [47]. The linear 
phase, or window of linearity (W-o-L) in the LinRegPCR 
terminology, was initially set for all reactions in com- 
mon, but individually adjusted if required to obtain a 
better fit (i.e. a higher R^) of the regression line to the 
linear phase of the amplification curve. 

Serial dilution series 

Efficiency and quantitative methods 

The observed amount of telomere and reference gene in 
each dilution step of the serial dilution series was deter- 
mined by the methods described by Pfaffl [48] and Ruijter 
and co-authors [46] (Equations 1-7 in the Appendix). The 
two methods differ in the way the amplification efficiency 
E is estimated. In the PfafQ method, efficiency is estimated 
from the slope of the standard curve (i.e. calibration 
curve), which describes the linear regression between the 
log amount DNA of the serial dilution series and the Cq 
values observed at each dilution (Additional file 2: Figure 
SI A, B). In the Ruijter method, amplification efficiency is 
estimated from the slope of the regression line fitted 
to the log linear phase of each amplification curve as 
implemented in the LinRegPCR software. The telomere 
and reference gene amplification efficiencies were 
obtained for each batch as an average of those individual 



amplification efficiencies that did not deviate by more 
than 5% from the average, as described in Ruijter and 
co-authors [46]. In addition, we estimated the average 
efficiency of the triplicate reactions to assess whether 
efficiency varied across the range of the dilution series. 

Similar to the Pfaffl method, a "standard curve" can be 
produced for the Ruijter method as the linear regression 
between the log amount DNA of the serial dilution 
series and the starting concentration NO observed at 
each dilution step. The slope and intercept of these 
standard curves allow us to estimate the observed 
amount of telomere and reference gene [47] (Equation 6 
in the Appendix). 

Precision, robustness, accuracy, and resolution 

Precision refers to the degree of variability in an esti- 
mate, and thus reflects the consistency (repeatability) of 
the results generated by a given assay. Robustness refers 
to the variation across the batches (i.e. runs or plates) 
performed for a given assay, whereas accuracy is the 
assays ability to produce results that are identical to the 
expected (true) values. For each assay, robustness was 
quantified in terms of the average coefficient of variation 
(CV) of observed Log DNA amounts across batches. 
Precision was quantified in terms of the standard devi- 
ation of triplicate Cq values, as well as in terms of the 
R^ value associated with the linear regression between 
the observed and expected Log DNA amount of the ser- 
ial dilution series. In addition, the slope of the linear re- 
gression line allows for assessment of an assays' accuracy 
in that a highly accurate assays have a slope equal to 
one. For example, in Figure 1, the depicted reference 
gene batches from assay III are highly accurate (slope = 1), 
but differ in their precision with the observed amount of 
DNA being less variable and hence more precise in batch 
A (R^ = 0.9963) relative to in batch B (R^ = 0.9536). 

The resolution of an assay reflects its ability to dis- 
criminate between consecutive steps of the dilution 
series and consequently depends on the precision and 
accuracy of an assay. We assessed the resolution of each 
assay by using a pairwise T-test to evaluate whether the 
average observed Log DNA amounts estimated for each 
consecutive step in the dilution series were significantly 
different from each other, as would be expected for 
assays with sufficient resolution. 

Humpback whale telomere length estimation 
Outlier detection 

Outliers are data points that can be disregarded as irregu- 
lar observations because they do not follow the distribu- 
tion of the rest of the data [49]. In the 60 "unknown" 
humpback whale samples we defined Cq values as outliers 
if they were outside the linear dynamic range of the stand- 
ard curve or if the estimated Grubbs' [50] G -statistic for 
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Figure 1 Precision and accuracy illustrated by two reference 
gene batchies from assay III where the expected input amount 
of DNA is plotted against the observed amount of DNA. The 

solid line is the best fit through the observed amount DNA (circles) 
and the stippled line the expected fit when y = x. 



Cq values or relative telomere length was lower than 1% 
(Equation 8 in the Appendix). 

Telomere length estimates across quantitative methods and 
assays 

For each assay, the telomere length of the "unknowns" 
was estimated using four quantitative approaches, differ- 
ing with respect to their estimation of amplification effi- 
ciency, their inter-batch normalization procedure, and 
how the raw fluorescence data is baseline corrected. 

That is, in addition to the PfafQ and Ruijter methods 
described above, telomere length was also estimated by 
the PfafQ method where baseline correction was per- 
formed in the ABI PRISM and Rotor-Gene software 
rather than LinRegPCR, as well as by the comparative Cq 
method [51]. In the comparative Cq method, efficiency is 
assumed to be constant and similar for both the telomere 
and reference gene reaction, and inter-batch variation is 
adjusted by normalization to the Cq value of a calibrator 
sample included in each batch (Equations 1-4). 

In addition to individual telomere lengths, we esti- 
mated the difference between the minimum and max- 
imum telomere length (dMinMax), the average standard 



deviation of all telomere length estimates (SD), and the 
ratio between these two measures, which we take to re- 
flect the degree of resolution of the assay. 

Finally, the telomere length estimates obtained with 
different quantitative methods were compared under the 
rationale that the choice of quantitative method should 
have minimal effect on telomere length estimates if the 
assay performs well, that is, large variation in telomere 
length estimates from a given assay may indicate that 
the performance of this assay is suboptimal. 

Results 

Primer optimization 

Six of the initial reference gene primer pairs were dis- 
carded because the failed to amplify, targeted multiple 
amplicons, or did not amplify consistently across the 
range of the dilution series (Additional file 1: Table SI). 
The 36B4 reference gene primer pair passed the above 
tests, but was discarded because Sanger sequencing did 
not provide a satisfactory target sequence. The remaining 
four primer pairs (two telomere and two reference genes) 
amplified with adequate specificity and consistency. The 
linear dynamic range of these primer pairs was approxi- 
mately similar at 4.5-72.0 ng/reaction for assays I and IV 
and 2.2-60.0 ng/reaction for assays II and III, with amplifi- 
cation efficiencies decreasing at higher DNA amounts 
(Table 1, Additional file 2: Figure SI). 

Serial dilution series 

Non template controls (NTCs) 

NTCs are used to detect PCR contamination and to dis- 
tinguish unintended amplification products such as pri- 
mer dimers in SYBR green reactions. In the telomere 
reactions of assay I and IV, the NTC crossed the amplifi- 
cation threshold 10 cycles or more after the most diluted 
standard DNA (Table 1, Additional file 3: Figure S2A). In 
assay II and III, the difference in Cq values between the 
most diluted standard and the NTC was less than 5 cycles. 
In the reference gene reactions of assay I, the NTCs had 
Cq values of 40 or more, whereas the NTC Cq values 
overlapped with the Cq value of the most diluted standard 
in assay II- IV (Additional file 3: Figure S2B). However, 
with respect to the multiplex assays (i.e. II and IV) exam- 
ination of the melting curve suggests that amplification of 
the NTCs is not a result of contamination, but caused by 
primer dimer formation between the telomere and the 
reference gene primers (Additional file 4: Figure S3). In 
Additional file 4: Figure S3C the NTC forms a peak at ap- 
proximately 87°C that is not observed in the standard and 
when the qPCRs are performed in singleplex reactions 
(Additional file 4: Figure S3A-B). Thus amplification of 
NTC in assays II and IV should not bias telomere length 
estimates. Assay III was performed in singleplex and thus 
for this assay amplification of the NTC may be a result of 
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contamination or primer dimers which could bias telo- 
mere length estimates. 



assays exhibited a significant correlation between log exp 
DNA and amplification efficiency. 



Amplification efficiency 

In both the telomere and the reference gene reac- 
tions, estimated amplification efficiencies were higher 
when determined from the slope of the standard curve 
compared to the estimates obtained with LinRegPCR 
(Figure 2). However, efficiency estimates obtained with 
LinRegPCR were generally less variable among batches 
within an assay and among the four different assays. 
Exceptions include assay IV, where the efficiency of the 
telomere reaction was higher than in the other three 
assays, and assay II where the efficiency of the reference 
reaction was very low and variable among batches. 

In assay I, there was a strong and highly significant 
correlation between efficiency and DNA amount in both 
the telomere and reference gene reactions (Additional 
file 5: Table S2). The correlation was strongest for the 
telomere reactions in which DNA amount per reac- 
tion explained more than 75% of the variation in 
amplification efficiencies with the most diluted stan- 
dards having higher reaction efficiencies (E = 62.7%) 
than the undiluted standard (E = 54.8%). None of the other 




Assay I 



Assay II Assay III Assay IV 




Assay I Assay II Assay III Assay IV 

Figure 2 Average amplification efficiencies and their standard 
deviation estimated by the standard curve method (dark grey) 
and the LinRegPCR program (white). A: the telomere reaction 
efficiencies. B: tlie reference gene efficiencies. 



Precision, robustness, accuracy, and resolution 

The precision, robustness and accuracy estimates obtained 
for each assay is summarized in Figure 3, Figure 4 and 
Table 2. In general, the telomere reactions were more pre- 
cise, robust and accurate than the reference gene reac- 
tions. More specifically, while the accuracy of all assays is 
high in the telomere reactions, precision is slightly lower 
in assay I relative to the other assays. In the reference gene 
reactions, accuracy varies between assays and quantitative 
methods, but only assay I has a slope that is significantly 
different from 1 (two-tailed T-test; 7=2.230, P = 0.028), 
suggesting that assay I is inaccurate under these specific 
conditions. Also, it appears that precision in assay II is 
slightly lower than in the other assays. Robustness is best 
for assay IV compared to the other assays. 

Assay I had the lowest resolution with the observed 
Log DNA amount not being significantly different be- 
tween the first three consecutive steps of the telomere 
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Figure 3 Precision of each assay quantified in terms of the 
standard deviation (SD) of Cq values estimated for each 
triplicate reaction. A: SD associated witli telomere Cq values. B: SD 
associated with reference gene Cq values. Circles denote the SD 
values estimated for each dilution series and batch and black bars 
mark the average. 



Olsen et al. BMC Genetics 2012, 13:77 
http://www.bionnedcentral.conn/1471-21 56/1 3/77 



Page 7 of 1 5 



20 1 


A 




18 - 




o 


16 - 






14 - 






^ 12 - 






^ 10 - 




o 








O 8 - 






U 4 . 




o 

O q" Q 


2 - 




o § i" 


0 - 




Assay I Assay n Assay III Assay IV 


90 1 


B 




80 - 




O 


70 - 






60 - 






g 50 - 






1 40- 




O 


o 30- 




o" 


> 






U 20 - 






10 - 




8" o § @- 


0 - 




Assay I Assay n Assay III Assay IV 


Figure 4 Robustness of each assay quantified in terms of the 


coefficient of variation (CV) of logObs estimates across batches. 


A: CV associated with telomere logObs values. B: CV associated with 


reference gene logObs values. Circles denote the CV values estimated 


for each dilution step and black bars mark the average CV for a 


particular assay. 





dilution series, and between the second and third step in 
the reference gene dilution series (Table 3). Assay II had 
low resolution between the third and forth step in the 
reference gene dilution series. Assays III and IV had ad- 
equate resolution between all steps of the dilution series. 



Finally, the influence of the precision and accuracy of 
different assays and quantitative methods on telomere 
length estimates was assessed by plotting the expected 
amount DNA from the dilution series against the ratio of 
telomere to reference gene (Figure 5). In assay I, the 
telomere-reference gene ratio was not constant for differ- 
ent dilution steps with the ratio tending to be lower than 
expected for the undiluted standards, but higher than 
expected for the intermediately diluted standards 
(Figure 5A, B). The telomere-reference gene ratio is more 
or less constant in assay II, but estimates are associated 
with a high degree of variability (Figure 5C, D). In assay 
III, the telomere-reference gene ratio appeared to be con- 
stant across dilution steps, except for the most diluted 
sample in which the ratio was slightly overestimated 
(Figure 5E, F). In assay IV, the PfafQ method was less pre- 
cise than the Ruijter method and appeared to slightly 
overestimate the telomere-reference gene ratio in the 
standards with the lowest and highest amount of DNA 
(Figure 5G, H). Except for this, the ratio was constant 
across dilution steps and precision was high, particularly 
at Log exp DNA of 1.56 and 1.26, corresponding to 36 
and 18 ng DNA per reaction, respectively. 

Humpback whale telomere length estimation 
Precision and resolution of telomere length estimates 

The number of outliers was low in assay I and IV, but rela- 
tively high in assay II and III (Table 4). These latter assays 
were also associated with the lowest precision of individ- 
ual telomere length estimates (i.e. a high CV) and the 
lowest resolution across the range of telomere lengths 
(i.e. a high SD/(dMinMax)). In assay IV, CV ranged from 
0.6% to 30.8% and averaged at approximately 9% depend- 
ing on quantitative method. Also the average standard de- 
viation of telomere lengths was within 4.9-6.2% of the 
total range of telomere lengths, which suggest a 2-4 times 



Table 2 Precision and accuracy of the telomere and reference gene reactions quantified in terms of the standard 
deviation of Cq values (Cq SD), as well as the correlation coefficient (R^) and the slope of the regression line between 
expected and observed amount log DNA as illustrated in Figure 1 



Target 


Assay 


Cq SD 


PfaffI 






Ruijter 








Slope 


SD 


R^ 


Slope 


SD 


Telomere 


1 


0.330 


95.6 


1.00 


0.021 


95.8 


1.00 


0.021 




II 


0.171 


98.5 


0.99 


0.019 


99.0 


0.99 


0.016 




III 


0.115 


99.4 


1.00 


0.012 


99.4 


1.00 


0.012 




IV 


0.116 


99.2 


1.00 


0.012 


99.5 


1.01 


0.010 


Reference 


1 


0.423 


95.7 


1.06 


0.027 


95.6 


1.00 


0.021 




II 


0.476 


92.0 


0.98 


0.045 


93.4 


1.02 


0.044 




III 


0.264 


97.0 


0.99 


0.027 


93.4 


0.97 


0.039 




IV 


0.196 


96.4 


0.97 


0.033 


97.6 


1.02 


0.022 
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Table 3 Resolution between consecutive steps of the dilution series determined by the P-values of a two-tailed T-test 
for comparing averages 



Assay 


Target 


Batches 


Exp amount DNA (ng/reaction) 












72.0-36.0 


36.0-18.0 


18.0-9.0 


9.0-4.5 


1 


Tel 


7 


0.103 


0.051 


0.013 


0.005 




Ref 


7 


0.029 


0.052 


0.024 


0.022 


IV 


Tel 


4 


0.014 


0.001 


0.000 


0.002 




Ref 


4 


0.010 


0.009 


0.007 


0.012 


Assay 


Target 


Batches 


Exp amount DNA (ng/reaction) 












60.0-20.0 


20.0-6.7 


6.7-2.2 




II 


Tel 


4 


0.013 


0.007 


0.002 






Ref 


4 


0.019 


0.018 


0.254 




III 


Tel 


4 


0.000 


0.001 


0.000 






Ref 


4 


0.032 


0.002 


0.020 





Bold values are non-significant at the 5% level and indicate that there is no difference between the amounts of DNA observed at two consecutive dilution steps. 



higher (better) resolution than the other assays, depending 
on the quantitative method used. 

Effect of quantitative method 

For each assay, the telomere length of the "unknowns" 
was estimated using four quantitative approaches differ- 
ing with respect to their estimation of amplification effi- 
ciency, their inter-batch normalization procedure, and 
how the raw fluorescence data was baseline corrected. 

In assay I and IV, the correlation between telomere 
length estimates obtained with different quantitative meth- 
ods was generally good with = 81.9-95.6% and = 87.7- 
96.6% for assays I and IV, respectively (Additional file 6: 
Table S3). In contrast, telomere length estimates based on 
the data generated in assay II and III exhibited greater vari- 
ation across quantification methods, suggesting that the 
results generated from these assays should be inter- 
preted with caution. This was most prominent for the 
assay II data in which telomere lengths correlated 
with as little as R^ = 11.0% when estimated by the 
methods of Livak and Schmittigen (2001) and Ruijter 
(2009), respectively. If we assume that assay IV pro- 
vide the most reliable results it appears that although 
the estimated amplification efficiencies differed by as 
much as 20% between the Pfaffl and Ruijter methods 
(Figure 2) the correlation between telomere lengths 
estimates was still high (R^ = 96.6%), and similar or margin- 
ally better than telomere length estimates obtained using 
different baseline correction methods (i.e. R^ = 91.9%). 

Discussion 

The qPCR method for telomere length estimation has 
been criticized for being difficult to optimize, extremely 
sensitive to technical errors, and for being misused in 
assessments of telomere dynamics in model and non- 



model species [30-32]. This criticism coincides with an in- 
creasing awareness of the requirement and wide adoption 
of a general set of guidelines for experimental, analytical 
and reporting steps of qPCR in general [26-28]. Our study 
indicate that this criticism is not unjust, but in doing so 
also identifies some of the factors that may cause variation 
in qPCR experiments and exemplifies a series of quality 
controls that may aid in detecting suboptimal data 
(Table 5). Importantly, we show that it is possible to ob- 
tain high precision and accuracy in relative telomere 
length estimates by qPCR, but it requires extensive 
optimization and quality control at all experimental and 
analytical steps. 

Assay evaluation 
Assay I 

The performance of assay I and the other three assays is 
summarized in Table 5. To judge from the efficiency of 
telomere and reference gene reactions and the R^ of the 
standard curves, assay I generally performed well. How- 
ever, closer examination suggests that, under the given cir- 
cumstances, this assay is associated with moderate levels 
of precision and a resulting low resolution between dilu- 
tion steps. In addition, the efficiency estimated with the 
LinRegPCR software tended to vary across dilution steps, 
with the most diluted standards having significantly higher 
efficiency than the undiluted standards. The correlation 
between DNA amount and efficiency was stronger for 
telomere than the reference gene, indicating that telomere 
length may be underestimated for individuals with long 
telomeres (i.e. high amount of telomere DNA). This suspi- 
cion was strengthened by the observation that the ratio 
between telomere and reference gene was lower at high 
amounts of DNA relative to intermediate amounts. 
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Figure 5 Precision and accuracy of the ratio between telomere and reference in the standards estimated with the PfaffI method (A, C, E, G) 
or Ruijter method (B, D, F, H) AB: assay I; CD: assay II, EF: assay III; GH: assay IV. 



Assay II 

Assay II was the worst performing of the four assays 
with low to moderate and highly variable amplification 
efficiencies. This was reflected in the low precision and 
accuracy of the reference gene reactions and the 
telomere-reference gene ratio, the high fraction of out- 
liers, the relatively low resolution among individual telo- 
mere length estimates, as well as the correlation in 
telomere lengths between different quantitative methods 
and with the other assays. Interestingly, the telomere 



reactions were found to have high precision, accuracy 
and resolution despite the fact that the NTCs indi- 
cated the potential presence of contamination or primer 
dimers. 

Assay III 

In assay III amplification efficiencies were consistent 
across batches and precision, accuracy, and resolution 
moderate to high in both the telomere and reference gene 
reactions. However, the Cq of the telomere and reference 
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Table 4 Basic characteristics of telomere lengths in humpback whales determined by different assays and quantitative 
methods 









T/S 








SD 






cv% 








Assay 


Method 


N 


Av 


Min 


Max 


dMinMax 


Av 


Min 


Max 


Av 


Min 


Max 


SD/(dMinMax)% 


1 


PfaffI LR 


58 


1.05 


0.38 


1.89 


1.52 


0.16 


0.01 


0.71 


19.0 


1.8 


52.0 


10.9 




PfaffI RG 


60 


1.29 


0.38 


2.42 


2.04 


0.22 


0.01 


0.47 


15.8 


1.6 


47.8 


11.0 




Ruijter 


60 


1.36 


0.36 


2.60 


2.24 


0.25 


0.02 


0.81 


17.1 


4.7 


41.2 


11.2 




ddCq 


58 


1.19 


0.29 


2.17 


1.88 


0.23 


0.01 


0.80 


17.9 


1.6 


49.5 


12.4 


II 


PfaffI LR 


54 


0.84 


0.15 


1.73 


1.57 


0.19 


0.01 


0.71 


21.4 


3.3 


72.1 


12.1 




PfaffI RG 


49 


0.77 


0.15 


1.80 


1.65 


0.31 


0.03 


1.86 


35.5 


6.0 


131.0 


19.1 




Ruijter 


51 


0.73 


0.15 


1.51 


1.36 


0.16 


0.01 


0.58 


21.8 


3.3 


72.1 


11.9 




ddCq 


54 


3.38 


0.07 


20.46 


20.39 


1.21 


0.02 


9.76 


30.3 


3.6 


93.2 


5.9 


III 


PfaffI LR 


54 


0.80 


0.24 


1.39 


1.15 


0.20 


0.00 


1.22 


25.1 


0.2 


89.1 


17.1 




PfaffI RG 


55 


0.87 


0.19 


1.62 


1.43 


0.31 


0.04 


1.12 


49.6 


4.5 


372.4 


21.6 




Ruijter 


56 


0.82 


0.17 


1.40 


1.23 


0.21 


0.01 


1.25 


26.4 


0.6 


89.1 


17.3 




ddCq 


54 


0.73 


0.19 


1.94 


1.75 


0.22 


0.00 


1.39 


30.6 


0.1 


111.9 


12.3 


IV 


PfaffI LR 


59 


1.58 


0.44 


2.87 


2.43 


0.14 


0.01 


0.42 


8.7 


1.3 


29.6 


5.7 




PfaffI RG 


59 


1.61 


0.45 


3.15 


2.70 


0.14 


0.02 


0.51 


9.0 


0.6 


30.8 


5.3 




Ruijter 


59 


1.54 


0.45 


2.62 


2.16 


0.13 


0.01 


0.44 


8.8 


1.3 


29.6 


6.2 




ddCq 


60 


1.53 


0.45 


2.93 


2.48 


0.12 


0.01 


0.31 


8.2 


1.3 


27.2 


4.9 



N = number of samples after outlier removal; T/S = Telomere lengtli; Av = average; Min = minimum T/S; Max = maximum T/S; dMinMax = difference between 
minimum and maximum T/S; SD = standard deviation of T/S estimates; CV% = the coefficient of variation of T/S estimates in percent; SD/(dMinMax) = the ratio in 
percent between SD and the difference between minimum and maximum T/S. PfaffI LR = the PfaffI method with baseline corrected in LinRegPCR; Ruijter = the 
Ruijter method; ddCq = the comparative Cq method; PfaffI RG = the PfaffI method with baseline corrected in Rotor-Gene or ABI software. See text for details. 



gene NTCs either overlapped or were proximate to the 
Cq of the most diluted standards, indicating the presence 
of contamination or primer-dimers. Moreover, the assay 
was characterized by a relatively high number of outliers, 
high CVs and relatively low resolution among individual 
telomere length estimates, and only intermediate correl- 
ation between the telomere length estimates obtained by 
different quantitative methods. This serves to illustrate 
that high amplification efficiencies and a good fit of the 
standard curve is not a guarantee that an assay will pro- 
duce reliable results. 

Assay IV 

This assay was the best performing of the four assays, 
having moderate to high precision, accuracy and reso- 
lution both when examining the telomere and reference 
genes in isolation as well as for estimated telomere 
lengths. This was also reflected in a high correlation be- 
tween telomere lengths estimates obtained with different 
quantitative methods. Assay IV thus fulfill our criteria 
for a good assay. 

Factors affecting telomere length estimates 

Our evaluation of assay performance serves to identify 
some of the factors that may affect the precision and ac- 
curacy of qPCR in general, and telomere length estimation 
in specific. 



Telomere and reference gene primers 

A significant obstacle to qPCR amplification of telomeres 
was the difficulty associated with designing primers that 
hybridize to the telomere repeats without forming primer 
dimer derived products. Seemingly, this problem was 
overcome by Cawthon who designed a set of primers for 
telomere amplification in humans in which build-in mis- 
matches ensures higher primer specificity to the telomere 
repeat relative to the opposite primer [24,25,52]. As the 
telomere repeat is similar for all vertebrates [2] these pri- 
mers should be generic. We found that the telomere 
primers used in assay II-IV amplified with high preci- 
sion and accuracy, whereas precision was lower for 
the primers used in assay I. Moreover, in this assay 
amplification efficiencies for the telomere reactions 
appeared to be affected by the amount of DNA in the 
reaction. The telomere primers used in assay I differ 
from those of assay II, III and IV in that the latter 
amplify a product of fixed length with a well-defined 
melting profile, allowing for subsequent evaluation of 
each reaction. Hence, the telomere primers described 
in Cawthon [24] appear most suitable for telomere length 
estimation - also in situations where limited access to 
multiplex qPCR platforms and software necessitate the 
use of the singleplex method. 

The interspecific use of qPCR primers may be more 
problematic for the reference gene where potential 
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Table 5 Summary of quality control steps and overall assay performance for the telomere (T), reference (S), and 
telomere length estimates (T/S) 



Criteria 


Target 


Assay 1 


Assay II 


Assay III 


Assay IV 


Figure/Table 


Serial dilution series 














LDR 


T 


Borderline 


Within range 


Within range 


Within range 


Figure 2 




S 


Borderline 


Within range 


Within range 


Within range 




NTC 


T 


OK 


Borderline 


Borderline 


OK 


Figures 3, 4 




S 


OK 


Overlap* 


Overlap 


Overlap* 




Magnitude E 


T 


Intermediate 


Intermediate 


Intermediate 


High 


Figure 1 




S 


Intermediate 


Low 


Intermediate 


Intermediate 




Inter botch E 


T 


Constant 


Variable 


Constant 


Constant 


Figure 1 




S 


Constant 


Variable 


Constant 


Constant 




Dilution E 


T 


Variable E 


No change 


No change 


No change 


Table 3 




S 


Variable E 


No change 


No change 


No change 




Precision 


T 


Intermediate 


High 


High 


High 


Figure 5, Table 4 




S 


Intermediate 


Low 


Moderate 


High 




Accuracy 


T 


High 


High 


High 


High 


Table 4 




S 


High 


Intermediate 


High 


Intermediate 




Resolution 


T 


Low 


High 


High 


High 


Table 5 




S 


Intermediate 


Intermediate 


High 


High 




Precision 


T/S 


Intermediate 


Low 


Intermediate 


Intermediate 


Additional file 3: Figure S2 


Accuracy 


T/S 


Low 


Low 


High 


High 


Additional file 3: Figure S2 


"Unknown" samples 














Outliers 


T/S 


Few 


Many 


Many 


Few 


Additional file 1: Table SI 


Resolution 


T/S 


Intermediate 


Low 


Low 


High 


Additional file 1: Table SI 


Quantitative nnethod 


T/S 


High 


Low 


Intermediate 


High 


Additional file 5: Table S2 


Inter-assay correlation 


T/S 


Low 


Low 


Low 


Intermediate 


Additional file 4: Figure S3 


Assay evaluation 




Intermediate 


Worst 


Intermediate 


Best 





interspecific differences in occurrence of e.g. SNPs, splice 
variants, and pseudo genes can substantially affect primer 
specificity and hence amplification efficiency. Primers 
with high efficiency have higher precision (less vari- 
ability) than primers with lower efficiency [53]. The 
difficulties associated with obtaining good reference 
gene primers is illustrated by the fact that only two 
of the initial set of 8 primers passed quality control and 
that the performance of one of these — the assay I refer- 
ence gene primers — later turned out to be significantly 
affected by DNA amount. Hence in addition to initial 
optimization and quality control it is essential to perform 
subsequent evaluation once data has been generated. 

Moreover, because the single copy reference genes typic- 
ally amplif)A at a higher cycle number than the target 
genes, and because high Cq values usually are associated 
with larger variability [47], reference gene reactions may 
per default introduce a certain degree of variability into 
telomere length estimates. In the present study, the Cqs 
of the reference gene reactions were typically 15-20 cycles 



higher than the telomere reactions, and the precision of 
the reference gene was always lower than that of the telo- 
mere. Cycle number may be lowered, and hence precision 
increased, by increasing the amount of template DNA per 
reaction. However, the optimal amount likely differs for 
different assays, even when the same primers are used. For 
example, while precision in assay III appear to be highest 
at 20 ng template DNA per reaction, the precision of assay 
IV may, in retrospective, have been further increased by 
increasing the amount of template DNA from 20 ng to 
36 ng per reaction. Moreover, since a further increase to 
72 ng template DNA per reaction causes a reduction in 
precision, the optimal range of DNA template amount for 
telomere length measurement may be rather narrow. Con- 
sequently, we suggest that optimal amount of template 
DNA is identified by first using a serial dilution series with 
a high fold difference between dilution steps and subse- 
quently exploring the optimal range in more detail by a 
more narrow serial dilution series spanning a lower fold 
difference in amount template DNA. 
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To ensure high precision and reliability of results, Bustin 
and co-authors [26] recommended the use of at least three 
reference genes and that the performance of these are 
tested using e.g. geNorm [54]. The present telomere 
length estimations are based on normalization to a single 
reference gene, and although the reference gene primer 
pair used in assay II-IV has been validated and widely used 
in molecular studies of cetaceans [38], future applications 
of the qPCR approach to telomere length measurement 
may benefit from the inclusion of more reference genes. 

Experimental design 

Experimental design includes the choice of qPCR plat- 
form and consumables, whether to perform the telomere 
and reference gene reactions in singleplex or multiplex, 
which reagents to use, and the choice of standards for 
standard curves and inter-ba tch calibration. 

Platforms and their specific consumables may differ in 
their abilities to adjust the temperature of the qPCR reac- 
tion and their accuracy in measuring the fluorescence 
signal from each reaction. Here, assay I was conducted 
using an ABI PRISM™ 7000 Sequence Detection System 
while assays II-IV were performed in a QIAGEN Rotor- 
Gene Q (former Corbett Rotor-Gene 6000) qPCR cycler. 
Preliminary results suggest that qPCRs performed in the 
ABI platform are subject to edge-effects in which 
temperature differences across the heating block cause 
variation in reaction amplification efficiencies (Olsen MT, 
in prep). Although we attempted to reduce this effect by 
not using the wells along the edge of the plate minor 
variations in temperature across the heating block may 
have contributed to reduce the precision of assay I. 

Multiplexing may increase the precision of telomere 
length estimates by removing variability between the 
telomere and reference gene reactions caused by e.g. 
pipetting [24]. However if not sufficiently optimized, 
multiplexing can lead to the formation of primer dimers 
and/or cause inhibition of one or both reactions as a 
consequence of competition for reagents. The low amp- 
lification efficiencies of the reference gene reactions in 
assay II and the resulting variability of telomere length 
estimates produced with this assay is likely caused by 
such competition between the telomere and reference 
gene primers. If so, this inhibition is likely to be caused 
by suboptimal concentrations of reagents and/or primers 
in that all other aspects of assay II were similar to assay 
IV which worked fine. 

The use of standard curves to estimate telomere 
lengths and adjust for inter-batch variation relies on the 
assumption that standards have similar amplification 
efficiencies as the unknowns. We sought to reduce po- 
tential differences in efficiency by constructing standard 
curves from a pool of DNA extracted by the same 
method and from the same tissue and species as the 



unknowns. Further, the duration of the entire assay was 
short (5-21 days) and the standards kept in fridge in 
order to minimize potential DNA degradation caused by 
freeze-thawing and/or prolonged storage. Similarity in 
amplification efficiencies was controlled by estimating 
individual efficiencies with the LinRegPCR software. Pre- 
liminary tests suggest that standards retained their preci- 
sion and accuracy even after more than two months of 
storage in the fridge. 

Finally, it is important to realize that all qPCR experi- 
ments are associated with a certain degree of internal 
stochasticity and that some inter-batch and inter-assay 
variation is unavoidable because different batches and 
assay are carried out at different times and/or by differ- 
ent persons [28]. 

Data processing and analysis 

In qPCR, there are two aspects of amplification effi- 
ciency; one is the potential influence of primer efficiency 
on accuracy and precision as discussed above, and the 
other the influence of inferred amplification efficiency 
on data analysis. There is currently little consensus 
about the best method to determine the true amplifica- 
tion efficiency of a reaction, and new methods are con- 
tinuously reported which claim superiority to existing 
methods [45,46,51,54-58]. For assay IV, our comparison 
of telomere length estimates obtained with different 
methods show that even though efficiency estimates 
obtained from the standard curve (the Pfaffl method) 
and LinRegPCR (the Ruijter method) differed by 10-40%, 
the resulting telomere length estimates varied by less 
than 5%. Thus, if a given primer set/assay has documen- 
ted high precision and meets the other quality criteria, 
the inferred efficiency of this primer pair may be of less 
concern in telomere length measurements, as also sug- 
gested by Regier and Frey [59]. Indeed, the comparisons 
between the ddCq, Ruijter, and Pfaffl methods show that 
disregarding efficiency all together (the ddCq method) 
only increased inter-method variation in telomere length 
estimates to 10%, approximately the same level of vari- 
ation introduced by using different baseline correction 
methods (Pfaffl LR vs Pfaffl RG). 

Inappropriate technology 

Finally, the choice of technology for telomere length 
measurement may significantly affect estimates. The two 
most common methods for telomere length estimation 
are the telomere restriction fragment (TRF) and qPCR 
methods. We chose the latter because is it supposedly 
quicker, more sensitive, and less technical than the TRF 
method and thus allows for a higher throughput. The 
higher throughput also serves to reduce the costs - par- 
ticularly if the qPCR multiplex method is used. More- 
over, the qPCR method is associated with less stringent 
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requirements on DNA amount and quality [60], which is 
crucial when determining telomere length in free- 
ranging species such as humpback whales where sam- 
pling is remotely and skin tissue the only source of 
DNA. Recently, Kimura and Aviv [61] presented a new 
method for telomere length estimation based on dot blot 
analysis and the SYBR Dx DNA Blot Stain. This method 
does not require normalization to a reference gene and 
is performed in "multiplex", reducing the potential vari- 
ation caused by pipetting errors. Hence, the dot blot 
method may prove to be a less error-prone complement 
or alternative to the qPCR method, although its applic- 
ability for telomere length estimation in non-model spe- 
cies is still to be assessed. 

Conclusion 

Telomeres hold great promise as markers of individual life 
history, health, and fitness. The above should serve to il- 
lustrate some of the potentials and pitfalls associated with 
telomere length measurement by the qPCR method. As 
noted by Kimura and Aviv [61], the measurement of true 
telomere length is a standard that can only be approached 
asymptotically and it will never be possible to fully remove 
experimental, biological and stochastic errors. However, 
by recognizing its limitations and aiming at reducing 
error by careful experimental design and rigorous 
quality control at all stages of analysis, the qPCR method 
may nevertheless be good to obtain relative trends, espe- 
cially for non-model species where experimental alterna- 
tives are lacking. It is our hope that the analyses and 
quality control summarized in Table 5 may serve as a pre- 
liminary set of guidelines for a more rigorous quality 
control of telomere length estimates by qPCR, as well as 
inspire to the continuous use of telomeres as a proxy for 
biological age and life histories in model and non-model 
species. 

Appendix 

Quantification with the Pfaffl method [48] 

1. A standard curve is produced by plotting Log DNA 

amount against Cq to give a linear regression line. 
The line is described by y = ax + b, where y = Cq; 
X = Log expected DNA amount; a = the slope; b = 
the y-intercept 

2. Amplification efficiency of the standards are estimated 

from the standard curve as E = 10^(— 1/slope) and 
assumed to be similar to the efficiency of the 
"unknown" samples. 
3. The observed amount of telomere is estimated as 
T = (Cq calibrator — Cq sample) which 
corresponds to T = 

10^ ((Cq calibrator — Cq sample) /slope) or expressed 
in terms of the linear regression components T = 



10^ ((b — y)/a). The observed amount of reference 
gene (S) is estimated in the same way. 

4. Relative telomere length (T/S) is estimated as the ratio 

between amount telomere (T) and reference gene (S). 

Quantification with the Ruijter method [46] 

5. The starting concentration (NO) is determined by 

NO = Nt/(E^Cq), where Nt = is the threshold line 
and E = the average amplification efficiency of all 
reactions. 

6. We converted estimates of NO into observed amount 

of telomere (T) using T = 

10"^ ((log NO — intercept) /slope) which corresponds 
to T = 10^ ((y — b)/a) where a and b is the slope 
and intercept of the linear regression between 
expected Log DNA amount and Log NO. The 
observed amount of reference gene (S) is estimated 
in the same way. 

7. Relative telomere length (T/S) is estimated as the ratio 

between amount telomere (T) and reference gene (S). 

Grubbs' outlier detection [50] 

8. Grubbs' outlier detection test in which G = 

|(Cq — mean Cq)/SD|, where G is the test-statistic 
and SD the standard deviation. G > 5% are included 
in the analysis; G = 1-5% are defined as stragglers but 
kept in the analysis; G < 1% are outliers and 
discarded. 

Additional files 



Additional file 1: Table SI. Reference gene primers tested. BA = Berube 

and Aguilar [38]; CW 2006 = Callicot and Womack [52]; RC 

2009 = Cawthon [24]. Note that all reference gene primers used in assays 

ll-IV had a CG-clamp (CGGCGGCGGGCGGCGCGGGCTGGGCGG) attached 

to increase annealing temperature as described in Cawthon (2009). 

* Caution; this primer pair was used in assay I in which primer efficiency 

was found to correlate with DNA concentration. 

Additional file 2: Figure SI. Example standard curve for the telomere 
portion of assay II. In A. it is clear that amplification efficiency decrease in 
reactions with log DNA above 1.78 (20 ng DNA/reaction) causing 
deviation from linearity of the standard curve. As illustrated in B. the the 
linear dynamic range of the telomere primer is within log 1.778-0.347 
DNA, corresponding to 60-2.2 ng DNA per reaction. Note how the 
difference in the slopes of the standard curves in A and B affects the 
estimated amplification efficiency. In A. the amplification efficiency is 
E = 2.050 (105.0%) whereas it is E= 1.787 (78.7%) in B. Relationship 
between and observed "variable" (Ct, NO, fluorescence) against known 
concentration in a dilution series. 

Additional file 3: Figure S2. Amplification of non template controls 
(NTC) in the four assays relative to the most diluted standard in the serial 
dilution series in telomere (A) and reference gene (B). Circles denote the 
Cq value of individual NTC reactions and bars mark the average Cq and 
standard deviation of the most diluted standards. Note that the overlap 
between NTC and standards in the reference gene reactions of multiplex 
assay II and IV result from telomere-reference gene primer dimers as 
shown in Supplementary Figure 3A-C. 
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Additional file 4: Figure S3. Representative melting curves for the 
standard and NTC reactions. Black line is the most diluted standard and 
stippled line the NTC. A: the telomere reaction in assay III where the NTC 
starts amplifying a few cycles after the most diluted standard. B: the 
reference gene reaction in assay III where the NTC and standard overlap. 
C: the multiplex reaction in assay II in which telomere-reference gene 
primer dimers are producing a peak at approximately 87°C in the NTC 
but not in the standard. 

Additional file 5: Table S2. Linear regression and correlation between 
expected log amount DNA in standards and their amplification efficiencies 
as estimated in LinRegPCR. Bold values are significant at the 5% level. 
Slope = slope of the regression line; I ntercept = intercept of regression line 
reflecting the hypothetical maximum efficiency in percent. 

Additional file 6: Table S3. Percent correlation (R^) between telomere 
length estimates obtained by different quantitative methods. PfaffI 
LR = the PfaffI method with baseline corrected in LinRegPCR; Ruijter = the 
Ruijter method; ddCq = the comparative Cq method; PfaffI RG = the PfaffI 
method with baseline corrected in Rotor-Gene or ABI software. See text 
for details. 
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